๐Ÿฟ๏ธ ScourBrowse
LoginSign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
๐Ÿ‘๏ธ Perceptual Coding

Psychoacoustic Models, Lossy Compression, Human Perception, Audio Quality

Fast Algorithm for Moving Sound Source
arxiv.orgยท10h
๐Ÿ‘‚Psychoacoustic Coding
New Research on HEVC Video Double Encoding Detection Published in Journal of Imaging
blog.ampedsoftware.comยท39m
๐Ÿ–ผ๏ธJPEG Forensics
Spatial Audio in a Hat
hackaday.comยท1d
๐ŸŽฏTape Azimuth
๐ŸŽญ Compressing Human Faces with VAE vs VQ-VAE โ€” A Deep Dive into Autoencoder Design
dev.toยท21hยท
Discuss: DEV
๐Ÿ“ŠQuantization
NeuralMorse โ€“ Reinventing Morse Code with Neural Networks
masatohagiwara.netยท20hยท
Discuss: Hacker News
๐Ÿ“Text Compression
Why Computer Science Is No Good, Redux
cacm.acm.orgยท20h
๐ŸŽฏPerformance Proofs
The Best Earplugs
rollingstone.comยท23h
๐Ÿ‘‚Psychoacoustics
I Built a Way to Make Your Music Look as Good as it Sounds
hackernoon.comยท6h
๐Ÿ’ฟFLAC Archaeology
Real-time neural video codec โ€“ 100 FPS 1080p and 4K videos
github.comยท1dยท
Discuss: Hacker News
๐Ÿง Learned Codecs
Arduino Audio Spectrum on LED Dot Matrix 4 in 1 Display | DI
hackster.ioยท1h
๐ŸŽตGameboy Sound
Unsupervised Multi-channel Speech Dereverberation via Diffusion
arxiv.orgยท1d
๐Ÿ‘‚Psychoacoustic Coding
Investigating Gender Bias in LLM-Generated Stories via Psychological Stereotypes
arxiv.orgยท10h
๐Ÿง Intelligence Compression
IKOD: Mitigating Visual Attention Degradation in Large Vision-Language Models
arxiv.orgยท10h
๐Ÿ“ŠRate-Distortion Theory
READ: Real-time and Efficient Asynchronous Diffusion for Audio-driven Talking Head Generation
arxiv.orgยท10h
๐Ÿง Learned Codecs
SpectrumFM: A New Paradigm for Spectrum Cognition
arxiv.orgยท10h
๐Ÿง Intelligence Compression
Hearing More with Less: Multi-Modal Retrieval-and-Selection Augmented Conversational LLM-Based ASR
arxiv.orgยท1d
๐ŸŽตAudio ML
Context Guided Transformer Entropy Modeling for Video Compression
arxiv.orgยท1d
๐Ÿง Learned Codecs
Spatial-Frequency Aware for Object Detection in RAW Image
arxiv.orgยท1d
๐Ÿ“ŠLearned Metrics
Coherent Multimodal Reasoning with Iterative Self-Evaluation for Vision-Language Models
arxiv.orgยท10h
๐Ÿ“ŠLearned Metrics
Kitten TTS: 25MB CPU-Only, Open-Source Voice Model
algogist.comยท12hยท
Discuss: Hacker News
๐ŸŽงLearned Audio
Loading...Loading more...
AboutBlogChangelogRoadmap